REMARKS 



Claims 1-18 were pending in the patent application. 
By this amendment, Applicants cancel dependent Claims 2, 8 
and 14 and add new dependent Claims 19, 20 and 21. No 
additional filing fee is reguired. 

The Examiner has rejected Claims 13-18 under 35 USC 
112 as not supported by the Specification. Applicants 
respectfully assert that the original Claims 13 and 14 
described the program storage device for implementing the 
inventive method. One having skill in the relevant art 
would have understood that the inventors were in possession 
of the claimed invention at the time of application filing 
and would fully understand how to implement the program 
storage device from the description. By a previous 

amendment, Applicants amended the Specification to include 
a description of the program storage device on page 4. 
Applicants again amend the Specification on page 2. The 
amendment language finds support in the original Claims 13- 
14. Applicants reiterate that the language of the original 
claims 13 and 14 was sufficient to reasonably convey to one 
skilled in the relevant are that the inventors had 
possession of the claimed invention at the time the 
application was filed, in accordance with 35 USC 112, first 
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paragraph. Applicants request reconsideration of the 

rejection of the claims directed to the program storage 
device for storing a program of instructions executable by 
a computer for implementing the inventive method. 

The Examiner has rejected Claims 1-18 under 35 USC 
102(e) as anticipated by Sinai. For the reasons set forth 
below, Applicants respectfully assert that all of the 
pending claims are definite and patentable over the cited 
prior art. 

The present application teaches and claims a system 
for a user to input commands to generate a voice XML file, 
including directing how the audio content is to be 
delivered and inserting hyperlinks to linkable audio 
content. The system includes a graphic user interface for 
defining a plurality of first and second icons, wherein 
each of the first icons corresponds to one or more 
attributes of a voice XML and at least one second icon 
corresponds to a hyperlink to a linkable voice XML file. 
The system further includes a voice XML tag generator for 
interpreting the action stream of a user based on a library 
of voice XML tags and generating the corresponding voice 
XML tags and a voice XML file generator 103 for combining 
the contents to be played with the tags generated by the 
voice XML tag generator according to voice XML syntax for 
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creating the voice XML file including hyperlinks to XML 
voice files. Under the present invention, the user is 
dynamically creating a voice file for which a listener can 
provide hyperlink input to automatically invoke rendering 
of audio content from the linked XML voice file. 

The independent claims of the application have been 
amended to more clearly recite that the user is selecting 
icons for an existing content stream (page 6, lines 10-15) , 
that first and second icons can be selected (page 5, lines 
25-31), that icons are selected to add hyperlinks (page 4, 
line 9 and page 8, lines 5-19 and in Fig. 7) and that a 
voice XML is generated with both tags and hyperlinks where, 
upon listener hyperlink input to the generated voice XML 
file, audio accessed through said hyperlink is 
automatically delivered to the listener (page 4, lines 20- 
26 and page 8, lines 10-12) . 

Dependent Claims 2, 8 and 14 have been canceled. 
Accordingly, the dependency of several corresponding claims 
(Claims 3 and 4 which previously depended from Claim 2 ; 
Claims 9 and 10 which previously depended from Claim 8; and 
Claims 15 and 16 which previously depended from Claim 14) 
have been changed to appropriately depend from the amended 
independent claims. The language of those claims has also 
been amended to correspond with the amended language of the 
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independent claims from which they depend and upon which 
they rely for antecedent basis. Further, Claims 19-21 have 
been added to expressly recite the provision of the 
<bargein+true> prompt to allow a listener to interrupt 
display of the generated voice XML file in order to invoke 
a hyperlink for display of a linkable voice XML file (page 
6, line 28 and page 7, line 9) . No new matter is added by 
any of the claim amendments. 

The Examiner has cited the Sinai patent as 
anticipating all of Claims 1-18. The Sinai patent is 
directed to a system and method for a developer to create 
an audio dialog file with which a user can interact. The 
developer graphically defines a dialog flow comprising a 
sequence of speech objects in order to "pre-record" voice 
files that can be played to a user in a voice response 
system. The developer selects and drags speech objects 
from a speech object palette and drops them into a display 
grid (col. 9, lines 18-23). The developer places the 
speech objects in the grid in a preferred order that 
corresponds to the voice response interaction that is being 
developed. A VXML writer generates VXML representing the 
current state of grid and stores the dialog flow (Col. 10, 
lines 57-62) . The VXML is simply a concatenation of the 
pre-recorded speech corresponding to the selected speech 
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objects. The created dialog flow may be used in 

conjunction with speech recognition software to conduct an 
interactive dialog with a user, by prompting input from the 
user, interpreting user responses, and then selecting the 
appropriate next response in the dialog flow based on the 
user response. 

Applicants acknowledge that Sinai teaches a tool for 
graphically defining a dialog flow. However, Applicants 
disagree with the Examiner's conclusion that Sinai's system 
and method for graphically defining a dialog flow 
anticipates the invention as claimed. With specific 

reference to the claim language, the present application 
teaches and claims a system, method and program storage 
device for a computer network user to create a voice XML 
file from a TTS file or a read-time recorded audio file 
automatically. While the present invention dynamically 
creates XML tags and dynamically generates the XML file 
from user input, the Sinai system simply retrieves pre- 
recorded XML speech for selected speech objects. 

As recited in each of the pending claims, the present 
invention provides a graphic user interface (GUI) for 
defining a plurality of first and second icons, each of 
said first icons corresponding to one or more attributes of 
voice XML with at least one second icon corresponding to a 
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hyperlink to a linkable voice XML file. Sinai simply 
provides speech objects in a palette for selection by a 
developer (Fig. 4) . 

The claimed invention receives user selection input of 
icons to edit a content stream displayed in the GUI to 
customize audio output of the content stream and to add one 
or more hyperlinks to one or more linkable voice XML files. 
Sinai simply allows the developer to select speech objects 
in a desired order to create a dialog flow. 

The present invention records an action stream of a 
user invoking icons in the graphic user interface. 
Applicants acknowledge that the Sinai system continues to 
display the speech objects dragged and dropped onto the 
grid (Col. 10, lines 57-58), but maintains that Sinai does 
not expressly teach that the developer selection of speech 
objects is recorded. Under the Sinai teachings, the 
developer selects the speech objects and the "drag-and-drop 
interpreter 122 receives the drag-and-drop-type user inputs 
directed to the toolbox 121 and the grid 123" (Col. 10, 
lines 55-57. What Sinai describes is standard GUI input 
processing, which is not the same as or suggestive of 
recording an action stream 

With respect to the claim feature of steps and means 
for interpreting the action stream based on a library of 
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voice XML tags and generating voice XML tags for the 
content, Applicants respectfully assert that the Sinai 
patent does not teach recording an action stream and does 
not teach interpreting the action stream to generate voice 
XML tags. Applicants reiterate that the Sinai system 
simply retrieves pre-recorded information for selected 
speech objects. Sinai does not dynamically generate XML 
tags base on user's input interpreted from an action 
stream. 

With respect to the claim feature of steps and means 
for generating a voice XML file by combining the generated 
voice XML tags and the content stream including at least 
one hyperlink to a linkable voice XML file, Applicants 
contend that Sinai simply concatenates pre-recorded 
information associated with the selected speech objects. 
Sinai does not dynamically generate a voice XML file by 
combining newly generated XML tags with a content stream. 
Moreover, Sinai does not teach or suggest incorporating a 
hyperlink in a rendered voice XML file. While Sinai does 
teach that, during running of a developed dialog, a user's 
input may be used to access a Web page to return a stock 
quote or the like (Col. 11, lines 40-42), that is not the 
same as or suggestive of incorporating selectable 
hyperlinks into a dynamically generated voice XML file. 
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As recited in the claims as amended, the voice XML 
file generated by the present invention includes at least 
one hyperlink whereby, upon receipt of listener hyperlink 
input to the generated voice XML file, audio accessed 
through said hyperlink is automatically delivered to the 
listener. Sinai teaches no such hyperlink. 

Applicants respectfully contend that Sinai is not 
teaching or suggesting such a graphical user interface 
component or functionality as claimed. Further, there is 
nothing in the Sinai patent which teaches or suggests the 
steps and means for automatically adding one or more audio 
hyperlinks for a voice file (Claims 3-6, 9-12 and 15-18) . 
Moreover, there is nothing in the Sinai patent which 
teaches or suggests inclusion of an XML tag for allowing a 
user to "barge in" to interrupt a rendered audio stream to 
invoke an audio hyperlink for automatic retrieval and 
delivery of a linkable voice XML file. 

For a patent to anticipate another invention under 35 
USC § 102 (e) , the patent must clearly teach each and every 
claimed feature of the anticipated invention. Since the 
Sinai patent clearly does not teach the features of the 
system, method or program storage device as claimed, namely 
the steps and means for providing a graphic user interface 
for defining a plurality of first and second icons; 
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receiving user selection input of the icons to edit GUI- 
displayed content and add at least one hyperlink thereto; 
recording an action stream of a user invoking the icons in 
the graphic user interface; and interpreting the action 
stream to dynamically create a user voice XML file with at 
least one hyperlink, it cannot be maintained that the Sinai 
patent anticipates each and every claim feature of the 
independent claims, Claims 1, 7 and 13, or of the remaining 
claims which depend therefrom and add further limitations 
thereto . 

Based on the foregoing amendments and remarks, 
Applicants respectfully request entry of the amendments, 
reconsideration and withdrawal of the 112 rejections, 
reconsideration and withdrawal of the 102 rejections based 
on Sinai, and issuance of the claims. 

Respectfully submitted, 
J. T. C. Yeh, et al 

By: /Anne Vachon Dougherty/ 
Anne Vachon Dougherty 
Registration No. 30,374 
Tel. (914) 962-5910 
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